Using homology relations within a database markedly boosts protein sequence similarity search

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using homology relations within a database markedly boosts protein sequence similarity search.

Inference of homology from protein sequences provides an essential tool for analyzing protein structure, function, and evolution. Current sequence-based homology search methods are still unable to detect many similarities evident from protein spatial structures. In computer science a search engine can be improved by considering networks of known relationships within the search database. Here, w...

متن کامل

Automated protein sequence database classification. I. Integration of compositional similarity search, local similarity search, and multiple sequence alignment

MOTIVATION Genome sequencing projects require the periodic application of analysis tools that can classify and multiply align related protein sequence domains. Full automation of this task requires an efficient integration of similarity and alignment techniques. RESULTS We have developed a fully automated process that classifies entire protein sequence databases, resulting in alignment of the...

متن کامل

SW#db: GPU-Accelerated Exact Sequence Similarity Database Search

In recent years we have witnessed a growth in sequencing yield, the number of samples sequenced, and as a result-the growth of publicly maintained sequence databases. The increase of data present all around has put high requirements on protein similarity search algorithms with two ever-opposite goals: how to keep the running times acceptable while maintaining a high-enough level of sensitivity....

متن کامل

Similarity Search Using Pre-Search in UniRef100 Database

Sequence similarity in biological databases is used to characterize a newly discovered protein and confirming the existence of its homologs. This is often computationally very expensive. We have implemented a new algorithm that performs sequence similarity search using a pre-search phase. The proposed algorithm works in three phases. As a prepreparation for Pre-Search, we locate a sequence, sim...

متن کامل

Sequence homology within the morbilliviruses.

Double-stranded cDNA synthesized from total polyadenylate-containing mRNA extracted from monkey kidney cells infected with canine distemper virus (CDV) was cloned into the PstI site of Escherichia coli plasmid pBR322. Clones containing CDV DNA were identified by hybridization to a CDV-specific 32P-labeled cDNA. A cDNA clone containing an insert 1,700 base pairs (CDV 364) has been identified as ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the National Academy of Sciences

سال: 2015

ISSN: 0027-8424,1091-6490

DOI: 10.1073/pnas.1424324112